Using Tweets to Help Sentence Compression for News Highlights Generation
نویسندگان
چکیده
We explore using relevant tweets of a given news article to help sentence compression for generating compressive news highlights. We extend an unsupervised dependency-tree based sentence compression approach by incorporating tweet information to weight the tree edge in terms of informativeness and syntactic importance. The experimental results on a public corpus that contains both news articles and relevant tweets show that our proposed tweets guided sentence compression method can improve the summarization performance significantly compared to the baseline generic sentence compression method.
منابع مشابه
Utilizing Microblogs for Automatic News Highlights Extraction
Story highlights form a succinct single-document summary consisting of 3-4 highlight sentences that reflect the gist of a news article. Automatically producing news highlights is very challenging. We propose a novel method to improve news highlights extraction by using microblogs. The hypothesis is that microblog posts, although noisy, are not only indicative of important pieces of information ...
متن کاملUsing Relevant Public Posts to Enhance News Article Summarization
A news article summary usually consists of 2-3 key sentences that reflect the gist of that news article. In this paper we explore using public posts following a new article to improve automatic summary generation for the news article. We propose different approaches to incorporate information from public posts, including using frequency information from the posts to re-estimate bigram weights i...
متن کاملUncovering Topic Dynamics of Social Media and News: The Case of Ferguson
Looking at the dynamics of news content and social media content can help us understand the increasingly complex dynamics of the relationship between the media and the public surrounding noteworthy news events. Although topic models such as latent Dirichlet allocation (LDA) are valuable tools, they are a poor fit for analyses in which some documents, like news articles, tend to incorporate mult...
متن کاملTGSum: Build Tweet Guided Multi-Document Summarization Dataset
The development of summarization research has been significantly hampered by the costly acquisition of reference summaries. This paper proposes an effective way to automatically collect large scales of news-related multi-document summaries with reference to social media’s reactions. We utilize two types of social labels in tweets, i.e., hashtags and hyper-links. Hashtags are used to cluster doc...
متن کاملSentence Compression for Dutch Using Integer Linear Programming
Sentence compression is a valuable task in the framework of text summarization. In this paper we compress sentences from news articles taken from Dutch and Flemish newspapers using an integer linear programming approach. We rely on the Alpino parser available for Dutch and on the Latent Words Language Model. We demonstrate that the integer linear programming approach yields good results for com...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015